A New Approach Based on Feature Selection of Light Gradient Boosting Machine and Transformer to Predict circRNA-Disease Associations
نویسندگان
چکیده
Circular RNA (circRNA) is a type of single-stranded with closed circular structure. Recent studies have shown that circRNA has relatively more stable structure than its linear counterparts. The become biological marker in medicine and plays crucial role disease prediction. However, traditional experiments are often time-consuming laborious. More researchers taking computational approaches to predict the circRNA-disease associations rapidly reliably. In this paper, we propose novel method for predicting based on feature selection using Light Gradient Boosting Machine (LightGBM) self-attention neural network-Transformer (LGFRCDA). Firstly, histogram-based decision tree algorithm LightGBM used discretize continuous floating-point features into histogram integer numbers. While traversing samples, difference between histograms optimize calculation, greatly improving construction speed. Then leaf-wise employed calculate node maximum split gain, resulting final vector. Finally, these sorted order importance introduced Transformer information fusion Our study demonstrates after processing dimension reduction, LGFRCDA achieved prediction accuracy 95.44% AUC (Area Under receiver operating characteristic Curve), which 3.11% higher latest algorithms same dataset. We also conducted search published literature cross-validate predicted result. Out top 15 pairs by model, 13 were confirmed existing literature. These results indicate proposed model suitable can provide reliable candidates experiments.
منابع مشابه
Modeling and design of a diagnostic and screening algorithm based on hybrid feature selection-enabled linear support vector machine classification
Background: In the current study, a hybrid feature selection approach involving filter and wrapper methods is applied to some bioscience databases with various records, attributes and classes; hence, this strategy enjoys the advantages of both methods such as fast execution, generality, and accuracy. The purpose is diagnosing of the disease status and estimating of the patient survival. Method...
متن کاملIFSB-ReliefF: A New Instance and Feature Selection Algorithm Based on ReliefF
Increasing the use of Internet and some phenomena such as sensor networks has led to an unnecessary increasing the volume of information. Though it has many benefits, it causes problems such as storage space requirements and better processors, as well as data refinement to remove unnecessary data. Data reduction methods provide ways to select useful data from a large amount of duplicate, incomp...
متن کاملBlind Quantitative Steganalysis Based on Feature Fusion and Gradient Boosting
Blind quantitative steganalysis is about revealing more details about hidden information without any prior knowledge of steganograghy. Machine learning can be used to estimate some properties of hidden message for blind quantitative steganalysis. We propose a quantitative steganalysis method based on fusion of different steganalysis features and the estimator relies on gradient boosting. Experi...
متن کاملiranian english learners’ perception and personality: a dual approach to investigating influential factors on willingness to communicate
abstract previous studies on willingness to communicate (wtc) have shown the influence of many individual or situational factors on students’ tendency to engage in classroom communication, in which wtc has been viewed either at the trait-level or situational level. however, due to the complexity of the notion of willingness to communicate, the present study suggests that these two strands are ...
task-based language teaching in iran: a mixed study through constructing and validating a new questionnaire based on theoretical, sociocultural, and educational frameworks
جنبه های گوناگونی از زندگی در ایران را از جمله سبک زندگی، علم و امکانات فنی و تکنولوژیکی می توان کم یا بیش وارداتی در نظر گرفت. زبان انگلیسی و روش تدریس آن نیز از این قاعده مثتسنی نیست. با این حال گاهی سوال پیش می آید که آیا یک روش خاص با زیر ساخت های نظری، فرهنگی اجتماعی و آموزشی جامعه ایرانی سازگاری دارد یا خیر. این تحقیق بر اساس روش های ترکیبی انجام شده است.پرسش نامه ای نیز برای زبان آموزان ...
ذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2023
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2023.3275967